A Survey of Software Fault Tolerance Techniques

نویسنده

  • Jonathan M. Smith
چکیده

This report examines the state of the field of software fault tolerance. Terminology, techniques for building reliable systems, andfault tolerance are discussed. While a scientific consensus on the measurement of software reliability has not been reached, software systems are sufficiently pervasive that' 'software components" of larger systems must be reliable, since dependence is placed on them. Fault tolerant systems utilize redundant components to mitigate the e[fecL~ of component failures, and thus create a system which is more reliable than a single component. This idea can be applied to software systems as well. Several techniques for designing fault tolerant software systems are discussed and assessed qualitatively, where "software fault" refers to what is more commonly known as a bug. The assumptions, relative merits, available experimental results, and implementation experience are discussed for each technique. This leads us to some conclusions about the state of the field.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Survey of Software Fault Tolerance Techniques

The paper surveys various software fault tolerance techniques and methodologies. The techniques include traditional techniques: recovery blocks (RcB), n-version programming, n selfchecking Programming, retry blocks (RtB), n-copy programming and some new techniques: adaptive n-version systems, fuzzy voting, abstraction, parallel graph reduction, rejuvenation. The utility for each technique based...

متن کامل

A Survey of Distributed Fault Tolerance Strategies

Grid computing is defined as geographically distributed, heterogeneity (different hardware, software and networks), resource sharing, multiple administrators, dependable access, and Pervasive access within dynamic organizations. In grid computing, the rate of failure is much greater than in traditional parallel computing. Therefore, the fault tolerance is an important property in order to achie...

متن کامل

A Survey on Software Fault tolerance in Parallel Computing

Software almost inevitably contains defects. Do everything possible to reduce the fault rate; Use faulttolerance techniques to deal with software faults. Fault tolerance is the ability of a system to perform its function correctly even in the presence of internal faults. Most of the ordinary systems lack fault tolerant software fix. This paper surveys various software Fault Tolerance techniques...

متن کامل

Novel Defect Terminolgy Beside Evaluation And Design Fault Tolerant Logic Gates In Quantum-Dot Cellular Automata

Quantum dot Cellular Automata (QCA) is one of the important nano-level technologies for implementation of both combinational and sequential systems. QCA have the potential to achieve low power dissipation and operate high speed at THZ frequencies. However large probability of occurrence fabrication defects in QCA, is a fundamental challenge to use this emerging technology. Because of these vari...

متن کامل

Survey of Software Fault Localization

I present a survey of automated techniques for software fault localization. Fault localization or localizing the root cause of failure is one of the most difficult processes in software debugging. Hence, many automated techniques have emerged to help in this process. Most of these techniques are based on the principles used in real life for fault diagnosis. I have used these principles to class...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004